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Abstract  :In  this  paper,  we  investigate  a  scheme  for  approximating  a  two-person  zero-sum 
game  G  of  incomplete  information  by  means  of  a  natural  system  of  its  finite 
subgames.  The  main  question  is  :  For  large  m  and  n,  is  an  optimal  strategy  for  Gmn 
necessarily  an  £ -optimal  strategy  for  G?  fS'UjOrlS  .  .  . 
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Section  1.  Introduction. 

To  formalize  our  idea  of  approximating  a  two-person  zero-sum  game  of  incomplete 
information  by  its  subgames,  we  introduce  what  we  shall  call  a  game  structure.  A  game 
structure  is  a  system  of  the  form  (G,U,Fm,Cn)nun«l  •  Here  Q  =  (Q.B.P)  is  a  probability 


space,  U  =  (Ujj :  i  =  1 . M;  j  =  1 . N)  is  a  matrix  of  random  variables  on  Cl  (the  payoff 

matrix),  and  Fm  and  Gn  are  sub-c-fields  of  the  c-field  B  such  that  Fm+i  2  Fm  and 
Gn+i  a  G„.  We  put  F  =  Foo  =  the  cr-field  generated  by  y  Fm,  G  =  G«  =  the  o-field 

generated  by  U  Gn. 

For  m,n  =  1,2,...,<»,  let  Gmn  be  the  two-person,  zero-sum  game  in  which  a 
strategy  for  player  I  is  an  Fm-measurable  a:  Cl  ->  SM,  and  a  strategy  for  player  n  is  a 
Gn-measurable  p:  Q->  SN.  (Here  SM  is  the  simplex  (  xeRM  :  ?  x*  =  1,  xi  £  0  }.)  If 

player  I  plays  a  and  player  n  plays  p,  then  the  payoff  to  I  is  r(a,P)  =  E(  £Ujj  ai  Pj). 

Thus  in  the  game  Gmn.  Fm  and  Gn  embody  the  information  available  to  I  and  n, 
respectively.  If  Fm  and  Gn  are  finite,  then  Gmn  is  a  finite  approximation  to  the  game  G  = 

Goo~ 

By  standard  minimax  theorems  [Sion],  each  game  Gmn  (and  a  fortiori  G)  has  a 
saddle  point.  Let  Vmn  denote  the  value  of  the  game  Gmn  to  player  I,  and  let  V  * 

If  Fm  and  Gn  are  finite,  then  the  game  Gmn  is,  at  least  in  principle,  solvable  by 
finite  methods.  The  question  we  shall  study  is :  To  what  extent  is  an  optimal  strategy  for 
Gmn  a  useful  substitute  for  an  optimal  strategy  for  G?  An  ideal  result  along  these  lines 
would  be 


(1)  Fix  e  >  0.  Suppose  that,  for  mji  *  1,2,...,  amn  is  an  optimal  strategy 
for  I  in  Gmn-  Then,  for  all  sufficiently  large  m  and  n,  a1™  is  an  e-optimal  strategy  for  I  in 
G. 


As  we  shall  see,  ( 1 )  is,  alas,  in  general  false.  The  best  we  can  do  is  a  weaker 
version  of(l)  (Theorem  1),  and  a  special  case  of  (1)  (Theorem  2).  We  shall  state  these 


theorems  presently.  For  a  strategy  a  for  player  I  in  the  game  G,  let  Valn(a)  =  i|f  r(a,(5), 

where  (3  ranges  over  Gn-measurable  strategies  for  II.  (Thus  if  a  is  Fm-measurable,  then 
Valn(oc)  is  the  value  to  I  of  the  strategy  a  in  the  game  Gmn.)  We  shall  write  Valo(a)  for 
Val  «o(a). 

Theorem  1.  For  m,n  =  1,2,...,  suppose  that  a™  is  an  optimal  strategy  for  player 
I  in  Gmn.  and  that  Fm  and  Gn  are  Finite  a-fields.  Then  lim  lim  Valo(otmn)  =  V. 

m— x»  n— >oo 

Moreover,  this  convergence  is  uniform  in  the  choices  a"1”  of  optimal  strategies ,  i.e., 

lim  Km  inf  Valc(ot)  =  V,  where  A(mji)  is  the  set  of  strategies  optimal  for 
m-x»  n-x*>  ae  A(m.n) 

player  I  in  Gmn- 

Theorem  2  says  that,  under  an  additional  hypothesis,  (1)  does  hold.  This 
hypothesis,  which  we  shall  call  (M),  is  a  version  of  the  "continuity  of  information" 
assumption  first  used  in  [Milgrom-Weber].  (M)  says  roughly  that  the  joint  probability  on  F 
and  G  is  absolutely  continuous  with  respect  to  the  product  probability  on  F  x  G.  A  precise 
statement  of  (M)  will  be  found  in  Sec.  2. 

Theorem  2.  Assume  (M)  holds.  If,  for  m,n  *  1,2 .  amn  is  an  optimal 

strategy  for  I  in  Gmn.  then  lim  Valo(0tmn)  *V,  uniformly  in  the  choices  a1™  of 

m-»<> 
n— >» 

optimal  strategies. 

In  Sec.  2,  we  shall  prove  all  of  these  various  claims. 

Section  2.  Results. 

We  first  present  an  example  which  shows  that  assertion  (1)  of  the  introduction 
does  not  hold  in  general. 

Example  :  A  game  structure  in  which  (1)  fails. 

Let  ft  be  the  interval  [0,1)  with  Lebesgue  measure,  M=N=2,  and  the  payoff 
.  j  ifiVy  for  ij  =  1,2  (independent  of  o).  For  m  =  1,2,..., 
let  Fm  *=  Gm  =  the  o-field  on  [0,1)  generated  by  the  partition 


{  [  (k-l)/2m  ,  k/2m ) :  k  =  l,2,...,2m  }.  Thus  T  -  G  *  the  Borel  o-field  on  £2.  It  is 
easy  to  see  that,  for  all  m  and  n,  Vmn  =  0,  and  in  Gmn  the  players  have  the  optimal 
strategies  ctj  =  02  =  Pi  =  P2=  1/2,  for  all  coe  Q. 

For  finite  m  >  1,  consider  the  game  The  strategy  given  by 

1  if  cd  e  [  (n-l)/2,  n/2  ]  ,  n  odd 
.0  otherwise 

m,m-l 

a, 

is  easily  seen  to  be  optimal  for  player  I  in  the  game  Gm>m.i,  i.e.,  Valm.i(flm.m‘1)  =  0.  On 
the  other  hand,  Valc(fflm'm'1)  *  -1;  is  a  very  poor  strategy  for  player  I  in  G.  Thus 

in  any  system  ( amn  :  m,n  =  1,2,...)  of  optimal  strategies  for  player  I  in  which 

am,fn-l  asjtin.in-l  for  allm>l,  lim  ValG(0tmn)  <  V. 

fn--x»o 
n— >*® 

We  shall  next  prove  Theorems  1  and  2.  We  first  require  a  series  of  lemmas.  Our 
first  lemma  is  a  special  case  of  Theorem  2  of  {Blackwell*  Dubins]. 

Lemma  2.1  Let  (Xk)  be  a  uniformly  bounded  sequence  of  random  variables,  and 
suppose  that  Xk  -->  X«  a.s.  as  k~>».  Then  E(Xk  I  Gk )  ->  E(X«  I G  )  a.s. 
as  k  ->» . 

Our  second  lemma  computes  Valn(a). 

Lemma  2.2  Fix  a  strategy  a  for  I  in  G,  and  fix  n  e  {1,2,...,»*}-  Define  the 
random  variable  \  by  :  %  *  the  j  €  { 1,2,...,N}  which  minimizes  E(  5  Uij  a,  I  Gn  ). 

In  case  of  a  tie,  for  definiteness,  take  the  least  such  j.  Then ,  for  all  strategies  P  for  n  in 
G«ni 

Cl)  E(?Uj$aj)  S  T(a,P)  ,  so 

(ii)  Val„(a)  -  E(  min  E(  J  Uij  aj  I  Gn )  )  . 

Proof :  (i)  immediately  implies  (ii),  so  we  prove  (i).  Let  p  be  a  strategy  for  II  in 
G»n,  that  is,  a  Gn  -  measurable  P :  O  -->  SN .  Then,  since  $  pj  «  1,  we  have 


E(  X  Uj£  Ctj  I  On  )  ^  X  Pj  E(  X  Ujj  Ctj  I  Gn  )  =  E(  X  Ujj  ctj  (3j  I  Gn  )  a.s. 

1  ’  j  1  i  lj 

since  {3  is  Gn  -measurable.  Taking  expected  values  on  both  sides  yields  (i). 

Lemma  2.3  Let  (ak)  be  a  sequence  of  strategies  for  I  in  G,  and  suppose  that 
ak  -->  a  a.s.  as  Then 

(i)  for  fixed  n  =  1,2,..., Valn(ak)-->  Valn(a)  as  k and 

(ii)  Valk(ak)  -->  Valc(ct)  as  k  --><*>. 

Proof:  First  note  that  applying  (ii)  in  a  system  where  Gn  =  Gn+i  =  ...  =  Gc» 

Lr 

yields  (i),  so  (i)  is  a  special  case  of  (ii).  To  prove  (ii),  let  X*  =  X  Uij  ai  in  lemma  2.1; 

then  we  have  E(  X  U»  a*  I  Gv  )  -->  E(  X  Uji  ail  G  )  a.s.  as  k  -->  ««,  so  by 

i  *  1  i 

dominated  convergence  E(  min  E(  X  Un  a*  I  Gk ) )  -->  E(  min  E(  X  Ujj  aji  G  ) ).  By 

j  '  1  1  J  » 

lemma  2.2(ii),  we  are  done. 


Lemma  2.4  (i)  For  m,n  =  1,2,...,®®,  Hm  Vmn  *V.cn  and  lim  Vmn  «=  Vmoo . 

m— >«<>  n->» 

(ii)  Hm  Vmn  *  V. 

m->~ 

n->oo 

Proof :  Fix  n,  and  let  a  be  an  optimal  strategy  for  I  in  G«n.  Now  for  m  =  1,2,..., 
put  am  =  E(a  I  Fm)-  Thus  a  is  a  legal,  though  likely  not  optimal,  strategy  for  I  in  Gmn. 
We  have 

Valn(am)  S  Vmn  . 

By  lemma  2.1,  am  ->a  a.s.  By  lemma  2.3(i),  Hm  Valn(am)  =  Valn(a)  =  V**,.  By 

m~>~ 

the  inequality  direcdy  above,  we  infer  Hm^  Vmn  *  V«r».  By  symmetry,  we  also  have 

fen  Vmn  =  Vm««  for  all  m.  This  proves  (i).  Finally,  it  is  easy  to  see  that 
n-»» 

Vm«.  ^  Vmn  £  V  **,.  Qaim  (ii)  now  follows  by  letting  m,  n  -->•*. 

We  are  now  ready  to  prove  Theorem  1. 

Proof  of  Theorem  1 :  Suppose  that  a™®  is  an  optimal  strategy  for  player  I  in  G,  for 

m,n  *  1,2 .  We  claim  that,  for  m  *  1,2,...,  lim  ValG(amn)  =  Vm« .  To  prove  this, 

n--x*> 

fix  m.  We  shall  show  that  every  subsequence  of  the  sequence  ValG(otm  )  has  in  turn  a 
subsequence  which  converges  to  Vm«<>.  Indeed,  since  each  a™1  is  Fm  -  measurable,  Fm 
being  a  finite  o-field,  by  the  Bolzano- Weierstrass  theorem  every  subsequence  of  am'  has 


a  (pointwise)  convergent  subsequence;  thus  we  may  assume  that  amn  -->  am  as  n  -->  «©. 
By  lemma  2.3(ii),  then,  Val^a™)  -->  ValQ(am)  asn-->oo.  By  hypothesis, 
Valn(amn)  =  Vmn.  so  in  fact  Vmn  — >  ValG(cxm)  as  n  — >  <».  Thus  by  lemma  2.4(i), 
ValG(ctm)  =  Vm<K>.  On  the  other  hand,  since  a1™1  — >  am  as  n  — >  «>,  by  2.3(i)  we  also 
have  ValGCa™) -->  ValG(txm)  *  Vmoo.  This  proves  our  claim. 

Now  by  another  use  of  lemma  2.3(i),  lim  Hm  ValG(otmn)  =  V.  To  prove  the 

m--xx>  n— >o» 

"moreover"  clause  in  Theorem  1,  let  (emn)  be  a  sequence  of  numbers  which  converges  to 

0  as  m,n  -->  <*>.  For  all  m,n,  note  that  there  exists  amn  e  A(m,n)  such  that 

ValG(amn)  -  emn  ^  inf  Valo(ot)  £  Valc(otrnn)  .  The  "moreover"  clause 
ole  A(m.n) 

follows  at  once. 

We  now  consider  Theorem  2.  We  must  first  discuss  hypothesis  (M). 

F  x  G  is  the  cr-field  on  ft  x  fi  generated  by  sets  of  the  form  S  x  T,  where  S  e  F  and 
T  e  G.  Let  Q  and  R  be  the  probability  measures  on  (Q  x  £},F  x  G  )  defiined  by 

Q(A)  =  P(  j  £0 :  (co.co)  e  A  } )  and 

R(A)  *  J  J (  (oj,ti)€  A }  1  P(dto) P(dq)  for  A eF  x G  . 

We  now  state  assumption  (M). 

(M)  Q  is  absolutely  continuous  with  respect  to  R,  that  is,  for  all  A  e  F  x  G,  if  R(A)=0, 
then  Q(A)=0. 

Assumption  (M)  is  a  version  of  a  hypothesis  introduced  in  [Milgrom- Weber],  It  is 
easy  to  see  that  (M)  is  satisfied  either  if  F  and  G  are  independent  (in  which  case  Q=R),  or 
if  either  For  G  is  atomic. 

Lemma  2.5  Suppose  (M)  is  satisfied.  Then  if  (Xfc)  is  a  uniformly  bounded 
sequence  of  F-mcasurable  random  variables  which  converges  weakly  to  and  if  Z  is 
any  bounded  random  variable,  then 

i)  E(XkZI  G)  ->  E(X_ZI  G)  a.s.  and 

ii)  E(XkZI  Gk)->  E(X«ZI  G)  a.s.  as  k  . 

Proof:  First  note  that,  since  E(  E(  Xk  Z I G )  I  Gk)  s  E(  Xk  Z I  Gk),  by  lemma 
2.1, 0)  implies  (ii) .  Next,  note  that  we  may  assume  without  loss  of  generality  that  Z  is 
measurable  in  F  v  G  (the  -field  generated  by  F  U  G ) .  This  is  because  E(  Xk  Z I  G  )  = 


E(  XkE(  Z I F  v  G)  1 0  ),  so  we  may  replace  Z  by  E(  Z I F  v  G)  if  necessary.  We  shall 
therefore  prove  (i),  assuming  that  Z  is  F  v  G  -  measurable. 

Since  Z  is  F  v  G  *  measurable,  there  exist  a  bounded  F-measurable  random 
variable  a  bounded  G-measurable  random  variable  and  a  bounded,  Borel 
measurable  function  f :  R  x  R  -->  R  such  that  Z  «  f$,$). 

By  (M)  and  the  Radon- Nikodym  theorem,  there  exists  a  bounded  function 
g:flxfl->R  such  that,  for  all  A  £  FxG,  P({o> :  (co.co)  e  A  })  =  JA  g(co,Ti)P(dco)P(d,n) . 

It  follows  by  standard  methods  that,  for  any  vector  X  of  F-roeasurable  random  variables, 
any  vector  Y  of  G-measurable  random  variables,  and  any  Bore! -measurable  h  :  R2  -->  R , 
we  have  . 

(*)  E(  h(X,Y)  I  G )  (n)  =  Jft  h(X(a>)  ,Y(t» )  g(®,Tl)  P(dco)  a.s.  [t\]  . 

Now  by  (*)  we  have 

E(Xk Z I G)  On)  =  Xk(03)  f(^(co),^Cn))  g(®,n)  P(dco)  a.s.[q]  . 

Since,  by  assumption,  Xk  ->  X«  weakly,  we  have  E(Xk  Z I G)  (ri)  ->  E(X«  Z I G)  On) 
a.s.,  as  desired. 

In  exact  analogy  to  lemma  2.3,  we  have 

Lemma  2.6  Assume  that  (M)  holds.  If  (ak)  is  a  sequence  of  strategies  for  I  in  G 
which  converges  weakly  to  a  strategy  a,  then 

(i)  for  fixed  n  =  1,2,...,  «>,  Valn(ak)  -->  Valn(a) ,  and 

(ii)  Valk(ak)  ->  ValG(a)  as  k->  «» . 

Proof  of  Theorem  2:  Suppose  that  amn  is  an  optimal  strategy  for  player  I  in  Gmn, 
for  all  finite  m  and  n.  We  shall  prove  that  every  sequence  (mk4iit)  of  pairs  of  integers 
such  that  mk  -> «»  and  nk  ->  <»  has  a  subsequence  (m'k^t'k)  such  that 
Valcfot10  kA  k)  — >  V  as  k  — >  <•.  To  conserve  notation,  let  us  write  ak  for  an'k’nk  and  Vk 
for  Vm’bn’k.  By  weak  compactness,  we  may  choose  the  sequence  (ak)  to  converge 
weakly  to  a  strategy  a  as  k  ->*>.  By  lemma  2.6(ii),  Valn*k(ak)  -->  Valo(ot)  as 
k->  ®o.  By  assumption,  Valn^fa^  *  Vk,  and  by  lemma  2.4(ii),  Vk  ~>  V;  thus 
Valc(a)  »  V. 

On  the  other  hand,  since  (a’O  converges  weakly  to  a,  by  lemma  2.6(i), 

Valc(otk)  ->  Valc(cx)  =  V  as  k  ->  <*>.  Uniformity  follows  just  as  in  die  proof  of 
Theorem  1.  This  completes  the  proof  of  Theorem  2. 
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